Models for estimating bayes factors with applications to phylogeny and tests of monophyly.
نویسندگان
چکیده
Bayes factors comparing two or more competing hypotheses are often estimated by constructing a Markov chain Monte Carlo (MCMC) sampler to explore the joint space of the hypotheses. To obtain efficient Bayes factor estimates, Carlin and Chib (1995, Journal of the Royal Statistical Society, Series B57, 473-484) suggest adjusting the prior odds of the competing hypotheses so that the posterior odds are approximately one, then estimating the Bayes factor by simple division. A byproduct is that one often produces several independent MCMC chains, only one of which is actually used for estimation. We extend this approach to incorporate output from multiple chains by proposing three statistical models. The first assumes independent sampler draws and models the hypothesis indicator function using logistic regression for various choices of the prior odds. The two more complex models relax the independence assumption by allowing for higher-lag dependence within the MCMC output. These models allow us to estimate the uncertainty in our Bayes factor calculation and to fully use several different MCMC chains even when the prior odds of the hypotheses vary from chain to chain. We apply these methods to calculate Bayes factors for tests of monophyly in two phylogenetic examples. The first example explores the relationship of an unknown pathogen to a set of known pathogens. Identification of the unknown's monophyletic relationship may affect antibiotic choice in a clinical setting. The second example focuses on HIV recombination detection. For potential clinical application, these types of analyses must be completed as efficiently as possible.
منابع مشابه
Molecular Phylogeny ofthe Puntius (Hamilton, 1822) Based on Nuclear Gene RAG2
The tropical Asian cyprinid genus Puntius is a major part of the ichthyofauna in Southeast Asia. Systematic status of the genus Puntius among Cyprinidae, the most prominent freshwater fish all over the world, remain to be substantiated. The molecular phylogenetic analyses derived from Recombination activating genesequences (RAG2) for 35 representative samples of Malaysian Puntius and their alli...
متن کاملTesting monophyly without well-supported gene trees: evidence from multi-locus nuclear data conflicts with existing taxonomy in the snake tribe Thamnophiini.
Ideally, existing taxonomy would be consistent with phylogenetic estimates derived from rigorously analyzed data using appropriate methods. We present a multi-locus molecular analysis of the relationships among nine genera in the North American snake tribe Thamnophiini in order to test the monophyly of the crayfish snakes (genus Regina) and the earth snakes (genus Virginia). Sequence data from ...
متن کاملEmpirical Bayes Estimators with Uncertainty Measures for NEF-QVF Populations
The paper proposes empirical Bayes (EB) estimators for simultaneous estimation of means in the natural exponential family (NEF) with quadratic variance functions (QVF) models. Morris (1982, 1983a) characterized the NEF-QVF distributions which include among others the binomial, Poisson and normal distributions. In addition to the EB estimators, we provide approximations to the MSE’s of t...
متن کاملSubgeneric classification of Linaria (Plantaginaceae; Antirrhineae): molecular phylogeny and morphology revisited
Linaria Mill. (Plantaginaceae) with about 160 spp. is the largest genus of the tribe Antirrhineae. We conducted phylogenetic analyses of nuclear ribosomal DNA internal transcribed spacer region (ITS) and chloroplast DNA (rpl32-trnL) sequence data to test the monophyly of currently recognized sections in Linaria. For this purpose 86 species representing seven sections of Linaria and one species ...
متن کاملEstimating the Safety Benefits of Red Light Cameras at Signalized Intersections in Urban Areas Case Study: The City of Virginia Beach
The Highway Safety Manual [HSM, 2010] recommends safety evaluations be performed before implementing any roadway treatment to predict the expected safety consequences. Safety consequences can be measured using crash prediction models, Crash Modification Factor (CMFs), or both. This paper develops a CMF to show the expected impact of red-light cameras (RLCs) on safety at signalized intersections...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Biometrics
دوره 61 3 شماره
صفحات -
تاریخ انتشار 2005